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Abstract. We give a formula for the component at p of the p-adic height 
pairing of a divisor of degree on a hyperelliptic curve. We use this to give 
a Chabauty-like method for finding p-adic approximations to p-integral points 
on such curves when the Mordell-Weil rank of the Jacobian equals the genus. 
In this case we get an explicit bound for the number of such p-integral points, 
and we are able to use the method in explicit computation. An important 
aspect of the method is that it only requires a basis of the Mordell-Weil group 
tcnsorcd with O. 



1. Introduction 

Chabauty's method [Cha41], later made effective by Coleman [Col85a], is a fan- 
tastic tool for bounding the number of rational points of a curve X over a number 
field F and for finding p-adic approximations to these points, when the genus g is 
larger than the Mordell-Weil rank of the Jacobian J of X over F. This method has 
been put to effective use in many instances over the last thirty years; for examples 
illustrating this technique, see [MP10] or [Fly97]. Chabauty wrote down p-adic 
functions that vanish on the set of -F-rational points X(F) and Coleman identified 
these functions as p-adic Coleman integrals of holomorphic forms [Col85b] . 

The fascinating recent work of Kim [Kim05, Kim09, KimlOb, CK10, BDCKW12] 
on a non-abelian Chabauty method gives hope that the restriction on the rank 
of J may be removed by using more general iterated Coleman integrals. When 
Kim's method applies and a Coleman function vanishing on X(F) is found, it can 
sometimes be computed explicitly by recent progress on the computation of such 
functions [BBK10, Ball2, BB12]. 

In a recent example for Kim's method [KimlOa], as corrected in [BKK11], an 
explicit Coleman function was given in the case of an elliptic curve E/Q with 
Mordell-Weil rank 1 over Q (satisfying some auxiliary conditions) and shown to 
vanish on all integral points. This function was furthermore explicitly computed 
in [BKK11] and numerically exhibited to vanish on integral points. 

In [BB13], the first two named authors gave an entirely new proof of Kim's 
result, removing, on the way, some of the auxiliary assumptions by showing that 
the function identified by Kim was essentially the component at p of the p-adic 
height and relying on the quadraticity of this height as a function on J and on the 
description of the other components as intersection multiplicities. 
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The goal of the present work is to extend the methods of [BB13] to hyperelliptic 
curves. Suppose that X is a hyperelliptic curve over Q given by the affine equation 

(i-i) y 2 = /(*), 

with / a polynomial of degree 2g + 1 over Z which does not reduce to a square 
modulo any prime number. Let p be a prime, let X. p = X x Q p , and let J be the 
Jacobian of X. We can prove 

Theorem 1.1. If the Mordell-Weil rank of J overQ is exactly g, then there exists a 
Coleman function p on X p and a finite set of values T such that p{U{Jj[l/p\j) C T , 
where U(Z[l/p\) is the set of p-integral solutions to (1.1). If X has good reduction 
at p, then T is effectively computable and p is effectively computable from a basis 
for J(Q) ® Q. 

A more precise version of this result will be given in Theorem 3.1 in Section 3. We 
use this to give an effective bound on the number of integral points in Theorem 6.4. 

We expect that the result above will lead to a practical method for recovering 
all integral points on hyperelliptic curves satisfying the assumptions of the theorem 
(see Remark 3.7). In Example 7.2, we show how carry this out in practice and find 
all integral points with x-coordinate having absolute value less than a prescribed 
bound. 

A different algorithm for the computation of integral points on hyperelliptic 
curves is given in [BMS + 08]. Their algorithm combines linear forms in logarithms 
with a modified version of the Mordell-Weil sieve and does not assume that the rank 
of J(Q) is equal to the genus. However, in contrast to our method, it does require 
generators of the free part of J(Q) which limits its applicability, since currently 
there is no practical algorithm for the saturation of a finite index subgroup of J(Q) 
unless g = 2 [Sto02]; the case g = 3 is currently being worked out [Stol3]. For a 
recent approach to this problem using arithmetic intersection theory, see [Holl2]. 

Our proof relies on the description of the p-adic height pairing given by Cole- 
man and Gross [CG89], the quadraticity of the pairing, and p-adic Arakclov the- 
ory [Bes05]. For any X and F, the height pairing 

h : J(F) x J(F) Q p , 

depending on some auxiliary data, is defined initially for divisors D\ and D 2 of 
degree zero with disjoint support as a sum of local height pairings 

h(D 1 ,D 2 ) = Y J h v {D 1 ,D 2 ) 

V 

over the finite places v of F. As in [BB13], a key point is to remove the disjoint 
support restriction by extending the local height pairings relative to a choice of 
tangent vectors, as suggested in the classical case in [Gro86]. For our hyperelliptic 
curve, we make a certain consistent choice of such tangent vectors. The resulting 
local height pairing gives rise to a function t(x) = h p (x — 00, x — 00) which is 
computed explicitly as a Coleman integral in Theorem 2.1 and is the main summand 
for the function p in Theorem 1.1. The finite set of values T results from the sum 
of the height pairings h q for q / p and is discussed in detail in Proposition 3.3. 

Let us explain the reason for the restriction that X is assumed to have good re- 
duction at p. As our work ultimately relies on p-adic Arakelov theory [Bes05] , which 
has been developed using Vologodsky's integration theory [Vol03], it is insensitive 
to the type of reduction at p. However, Vologodsky's integration has been, so far, 
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difficult to compute, and the computation of Coleman integrals has been, thus far, 
done in the good reduction case. The situation will likely change with [BZ13], which 
will reduce Vologodsky integration on semi-stable curves to Coleman integration. 

We briefly discuss possible applications (see also Remark 3.8). The function p is 
given, on the residue disk of p-adic points reducing to a given point, by a convergent 
power series. It is therefore possible, in reasonable time, to compute all solutions 
of p{x) e T to high p-adic precision. We hope that this technique can be combined 
with upper bounds on the size of the integral points on X [BMS + 08] to provide 
an effective algorithm for determining all integral points on hyperelliptic curves 
satisfying the assumptions of the theorem. 

It is furthermore easy, given some initial coefficients in the power series expan- 
sions making up p, to deduce a bound on the total number of possible solutions 
to p{x) e T, hence on the number of p-integral solutions to (1.1). A fairly crude 
version of this is given in Theorem 6.4. The bound depends on some computations, 
including those of Coleman integrals, on the curve. One might hope for a bound 
which, like [Col85a, (ii) on p. 765], will depend only on some simple numerical data 
of the curve, such as genus and types of bad reduction, but this is unfortunately 
not obvious at this point in time. 

We conclude by giving a number of numerical examples to illustrate our tech- 
niques. 
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2. The local height pairing as a Coleman function 

In this section we let p be a prime number and consider a smooth and proper 
curve C (we will later assume C is hyperelliptic) defined over the algebraic closure 
Q p of the field of p-adic numbers. We fix a point x on C and a tangent vector to 
to C at so- We further fix a decomposition 

(2.1) H 1 dR (C) = W(Sn 1 (C) 

such that W is isotropic with respect to the cup product and a branch of the p-adic 
logarithm which we denote by log. Given this data, the function hj- on the tangent 
bundle T to C outside of the point xq was defined in [BB13]. If G is the Green 
function associated to the data above, as defined in [Bes05], then hj- is defined as 

h T (x,t) = G x (x,t) - 2G Xa (x) + G Xo (x ,t ) , 

where G y {x) is by definition G{y,x) and the notation G x (x,t) stands for the con- 
stant term of G x at x with respect to a local parameter z normalized with respect 
to t in the sense that dt(z) = 1. This function is related to the local height pairing 
as follows: If all data are defined over a finite extension K of Q p , then the value 
hj-(x,t) lies in K and the local height, evaluated on the divisor (x) — (x ) with 
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choices of tangent vectors t and to, is exactly the trace from K to Q p of h-r(x,t), 
where the trace map is part of the data used to define the height pairing. 

Suppose now that C is a hyperelliptic curve defined by the equation y 2 = f(x) 
with / a polynomial of degree 2g + 1. We have a basis {uii}^ 1 of the de Rham 
cohomology H^ R (C/Q p ) given by the forms of the second kind u>i — x l dx/2y. The 
form u>i has order 2g — 2 — 2i at oo, and it has no pole away from oo. In particular, 
the forms u>i for i < g — 1 are holomorphic. We let lo[ = u>2g-i—i for < i < g — 1, 
so that uj[ has a pole of order 2(g — i) at oo. As in the introduction, we fix ui = luq. 
This form vanishes to order 2g — 2 at infinity and has no other zeros or poles. It 
determines, by duality, a section t of T with a pole of order 2g — 2 at infinity and 
no other zeros or poles. On the other hand, the form u> g -i has no zero or pole 
at infinity and we fix to to be the dual of its value there. We pick a parameter 
z = —y/(x 9+1 f2g+i) at infinity, where /2g+i is the leading coefficient of /. 

Let f cv (x) denote the polynomial x 2 9 +2 ,f{l/x) e Z[x\. Using [BB12, (10)] as 
well as the description at infinity of the u>i following (11) there, and noting that the 
parameter t there is — zf 2 g +i, we find a local equation at oo: 

Differentiating, we find that 

ds ds 



2* 2f 2g +iz 

and 



= (l + 0(z))dz 



Ui = s 9 - l - l Yt = (f2 9+ iz 2 ) g - l - l {l + 0{z))dz . 

In particular tu g -i = (1 + 0(z))dz, and one easily finds that z is normalized with 
respect to to- Let us also record then the local expansion 

(2.2) x = 8~ 1 =/ 2 - 1 +1 ^- 2 (l + •••)• 

As the oji only have poles at oo, the formula for the cup product in terms of integrals 
and residues gives 

[w'j] U [oji] = Resoo (^uji j uj'^j . 
Clearly, when g — 1 > j > i the form / uj'j is holomorphic at oo, while 

= f&r^-'-Hi + °^)^B§T-i z2 ^ 9)+1{l + 0{z))dz 



(l + 0(z)). 



zf 2g+ i(2(i-g) + l) 



We find 



(2-3) K'] U [Ui] = /ag+i(2 . + 1 _ 2g) , Kl U N = if g- 1 > j > i . 

Let {iOi}f~Q be the basis for W which is dual to the basis {wj}?~ for the holo- 
morphic forms via the cup product. From (2.3) it follows that 
(2-4) 

&i = f2 g +\{2i + 1 — 2g)oj i + combination of lu'j for j > i + a holomorphic form. 

Recall [BdJ12, Definition 7.7] that the constant term of a Coleman integral with 
respect to a parameter z at a point is the coefficient of 1 in its expansion which 
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includes both powers of z and non- negative powers of log(z). We are going to 
normalize our integrals in such a way that the constant term with respect to the 
parameter z at oo is 0. We will write J t (recall that z is normalized with respect to 
to) for this type of integral, which is the case when all integrands have logarithmic 
singularities at oo [BF06, Proposition 2.11] (with worse singularities, integration 
from a tangential basepoint makes no sense). 

Theorem 2.1. Let r be the pullback of hf under t. Then we have 




-(5-l)log(/ 2fl+ i) • 



Proof. According to [Bes05, Proposition 2.7], we have that r, being the pullback of 
a log function by a section, is a Coleman function on the complement U of infinity 
in C and if 8 denotes the p-adic del bar operator, then ddr is the pullback of the 
curvature of the restriction of hf to U under t(x). But the curvature on hf is 
-2^]-r o u>i (g> uji by [BB13, Proposition 3.10], and we obtain 

9-1 

dr = ^ / Oji + 6 

i=o 

where 8 is a holomorphic form on U and the integrals are indefinite. There is the 
indeterminancy of the constant of integration and of the form 8. Note that the 
constant of integration for the u)j simply adds to 9 holomorphic forms on C . Thus 
we have 

rx / f- 1 ry \ r 

t(x) + 2 / Y^uji / ut = / 6' 

J to \i=0 Jt « J J 

where 8' is holomorphic on U . We would like to show that the right hand side 
vanishes. 

First we argue that the form 8' extends to a holomorphic form on C . To see this 
we observe that it follows from (2.4) that uJi(J u>i) has a simple pole at infinity and 
no other pole. On the other hand, since w has no poles or zeros outside infinity, 
t(x) as a section of T also has no such zeros and poles outside infinity. It follows 
that the form dr also has a simple pole at infinity and no other pole. Indeed, 
by [Bes05, Definition 4.1] and the ensuing remarks, if s is a non- vanishing section 
of T, then locally for the analytic topology, r looks like log^s) + dlog(t/s) with 
log-j-(s) analytic. Thus, 9' has a simple pole at infinity, but since it is meromorphic, 
the residue theorem implies that it is in fact holomorphic on C (one can directly 
compute the residue at infinity and easily discover that it is 0). 

Next we claim that 8 = 0. This is because both r and the integral on the right 
hand side of the theorem are symmetric with respect to the hypcrclliptic involution 
w. For t this follows from functoriality of the height pairing similar to the way it 
is proved in [BB13]. For the integral, we first note that all forms considered are 
anti-symmetric with respect to w. Then the integral of a u>i is anti-symmetric up 
to a constant, and this is because the constant term with respect to a parameter 
which is itself anti-symmetric is 0. Thus, all terms u>i tDj are symmetric, and 
their integral is so as well. Consequently, 8 is a holomorphic symmetric form on C, 
hence 0. 

It follows that the equality in the statement of the theorem holds up to a constant. 
It suffices to show that the constant term of r at infinity is — (g — 1) log(/2 S +i). 
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With r replaced by the pullback of hf under the dual of the constant term is 

by [BB13, Proposition 3.10], since 0J g -i is dual to the non- vanishing to at infinity. 
However, this pullback differs from r by log(a; 5_1 ) = (g — 1) log(:r). Using (2.2) we 
find 

log(a;) = — 21og(z) — log(/ 2ff +i) + a power series vanishing at 0. 
The constant term of x with respect to z is therefore — log(/ 2g +i) and that of r is 
-(fl-l)log(/ 2fl+ i). ' □ 

3. Application to p-integral points 

The results of the previous section may be used, in a similar way to the results 
in [BB13], to obtain a p-adic characterization of p- integral points on hyperelliptic 
curves, a kind of "quadratic Chabauty." Let / G Z[x] be a separable polynomial of 
degree 2g + 1 > 3 such that / does not reduce to a square modulo q for any prime 
number q. 

Let U = Spec(Z[a;, y)/(y 2 — f(x))), so that U(Z[l/p]) is exactly the set of p- 
intcgral solutions to y 2 = f(x). Let X be the normalization of the closure of the 
generic fiber of U, and let J be its Jacobian. 

Theorem 3.1. Let f i} for i = 0, . . . , g — 1, be defined by the formula 

fi(z) = / Wj, 
J oo 

and let gij — gji — fi ■ fj for i < j. Suppose that the Mordell- Weil rank of J 
is exactly g. Then either there exists a linear combination of the fi that vanishes 
on U{1i[llp\) (in other words, Chabauty 's method works, at least for the p-integral 
points ), or there exist constants onj G Q p such that the function 

(3.1) p{z) = t{z) - a H9ij (z) 

satisfies the following: 

(1) There is an effectively computable constant s G Q p such that we have p{x) = 
s for every x G U{7L\Yjp\) which does not intersect any of the singular points 
in any of the bad fibers oflA. If f is monic, then we have s = 0. 

(2) There exists an effectively computable finite set of values T containing 
p(W(Z[l/p])). 

Proof. We recall that the height pairing depends on the choice of a Q p -valued idele 
class character £ — (B( q , a sum over all finite primes q. Fixing a choice for the 
p-adic logarithm, which we simply denote by log, we can choose i in such a way 
that £ p = log. In this case, the trace map mentioned in the introduction is simply 
the identity and the local height pairing h p is simply the pairing mentioned there, 
so that h p ((x) — (oo), (x) — (oo)), normalized as before, is simply t(x). 

If Chabauty's method does not work, the fi, extended linearly, induce linearly 
independent Q p -valued functionals on the ^-dimensional vector space J(Q) ® Q- 
It follows that the basis for the space of Q p -valued quadratic forms on 

J(Q) <g> Q. The height pairing h, being another Q p -valued quadratic form on V, 
is thus a linear combination of the above basis, h = otaf/a ■ If x G X(Q), then, 
applying the last formula to the divisor (x) — (oo), Theorem 2.1 implies that 

(3.2) p(x) = - ^ h q ((x) - (oo), (x) - (oo)) . 
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Note that the local heights h q are computed using the localization of the same 
global tangent vectors as before. According to Proposition 3.3 below, for each 
prime number q^p there is an effectively computable p-adic number s q such that 
when x is a p- integral point and either the reduction at q is good, or, more generally, 
the reduction is bad but x does not reduce to a singular point modulo q, we have 

h q ((x) - (oo), (x) - (oo)) = v 

See (5.1) for an explicit expression for s q . If q does not divide the leading coefficient 
/23+1 of / then s q — by Proposition 3.3. 

In general, Proposition 3.3 implies that there is a proper regular model X of 
X x Q g over Z g such that if x is p-integral, then h q ((x) — (00), (x) — (00)) depends 
solely (and explicitly, see Section 5) on the component of the special fiber X q that 
the section in X(Z q ) corresponding to x intersects. Thus, for any p-integral point x, 
the right hand side of (3.2) can only take a finite number of explicitly computable 
values, completing the proof. □ 

Remark 3.2. From the proof it is clear that the size of the set T is bounded by 
1 + n g ( m 9 — 1)> where m q is the number of multiplicity one components of the 
special fiber X q of the proper regular model X of X constructed below, as these are 
the only components through which the section corresponding to x can pass. 

We construct a normal model X' of X x Q q over Spec(Z g ) as follows: If F(X, Z) 
is the degree 2g + 2-homogenization of /, then the equation 

Y 2 = F(X, Z) 

gives a smooth plane projective model of C in projective space Pq (1, g + 1, 1) over 
Q q with respective weights 1,5 + 1 and 1 assigned to the variables X, Y, Z. Let 
X' be the scheme defined by the same equation in P| (1, g + 1, 1). We call X' the 
Zariski closure of X x Q q over Spec(Z 9 ). 

By [Liu02, Corollary 8.3.51] , there exists a proper regular model X of X x Q q over 
Z q , together with a proper birational morphism <j> : X — > X' that is an isomorphism 
outside the singular locus of X' . Such a model is called a desingularization in the 
strong sense of X' . If y is a Q 9 -rational point on X, then, by abuse of notation, we 
also denote the corresponding section y G X(Z q ). Our assumptions on / guarantee 
that there is a unique component To of the special fiber of X which dominates the 
special fiber of X' , since X' is normal with an irreducible and reduced special fiber. 
Then any y € X(Q) q ) whose reduction modulo q is nonsingular has the property 
that the section y intersects To- 

Proposition 3.3. Let X be as above and let x be a p-integral point. The value of 
h q ((x) — (00), (x) — (00)) depends only on the component T x of the fiber of X above 
q that x passes through and is effectively computable from T x . Moreover, this value 
is ifT x = r and q does not divide fig+i ■ 

Proof. Let us first recall how h q ((x) — (00), (x) — (00)) is computed [Gro86, Sec- 
tion 5]. Let ( . ) denote the rational- valued intersection multiplicity on X. Accord- 
ing to [Lan88, Theorem III. 3. 6], there is a vertical Q-divisor $((2:) — (00)) on X with 
rational coefficients such that D x = x — 00 + Q((x) — (00)) satisfies (D x . Y) = for 
all vertical divisors V e Div(A'). The local height pairing h q ((x) — (00), (x) — (00)) 
is then given by 

(3.3) h q ((x) - M,(x) - (00)) = -(D x .D x )\ogq . 
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Up to addition of a rational multiple of the entire special fiber X q , which is irrel- 
evant for intersections with vertical divisors, $((x) — (oo)) only depends on which 
components the sections x and oo pass through, see also Section 5. Note that the 
intersection (D x .D x ) is equal to 

(x-oo) 2 + $((x)-(oo)) 2 . 

One computes the first intersection using the following rule for self-intersection of 
horizontal components: Let x G X(Z q ), and let t be the chosen tangent vector at 
x. Then: 

• If t is a generator to the tangent bundle at x, then (x . x) = 0. 

• More generally, if at is a generator, with a G Q q , then (x . x) = —v q (a), 
where v q is the g-adic valuation (see 3 of Definition 2.3 in [BB13]). We note that 
the p-intcgrality of x implies that (x . oo) = 0. 

In our case the tangent vector at x is determined as the dual to the value of the 
differential form uj at that point. Lemma 3.4 (ii) below implies that the intersection 
multiplicity (x.x) depends only on r x . This proves the first statement of the 
proposition. 

Now suppose that x intersects Lo, then (x . x) = follows from Lemma 3.4 (iii). 
If we assume, in addition, that q docs not divide fig+\, then oo reduces to a 
nonsingular point and hence the corresponding section intersects r . Since x and 
oo pass through the same component, it is clear that we can take $((x) — (oo)) = 0. 
Finally, Lemma 3.4 (iv) implies that (oo . oo) =0. □ 

Lemma 3.4. Let X and lj be as above and write the divisor div(w) G Div(A') as 
div(w) = H + V, where H is horizontal and V is vertical. Let x G X(Q q ). 

(i) We have H = (2g - 2) • oo. 

(ii) For the intersection multiplicity, as normalized by our chosen tangent vec- 
tors, we have (x.x) = —(x. div(w)). Ln particular, (x.x) — —(x.V) if 
x eU(Z q ). 

(iii) The component Tq is not contained in V . 

(iv) We have (oo . oo) = — (oo.W), where W is the vertical part o/div(cj g _i) 
and the self-intersection is taken with respect to t . If q does not divide 
f2g+i, then this is equal to 0. 

Proof. The proof of (i) is obvious. We now turn to the proof of (ii). Using cotangent 
vectors, the formula for the self-intersection (x.x) is as follows: if auj(x) is a 
generator for the cotangent bundle at x, with a G Q p , then (x.x) = v q (a). But this 
is exactly — (x . div(w)). Indeed, for any line bundle C on X and any meromorphic 
section s of C that has no zeros or poles at x on the generic fiber, the pullback x*C 
is a free Z g -modulc of rank 1 and s(x) is a non-zero element of x* C <£> Q q . When 
s(x) e x* C and a ■ s(x) is a generator for x* C we have 

V q( a ) = -\ g q #( X *£/ S ( X )%q) 

and the right hand side is well-known to be equal to — (div(s) . x). The behavior of 
intersection with a principal divisor implies that this continues to hold without the 
assumption s(x) G x* C. 

When x G U(Z[l/p\) we know that (x . oo) = 0, so using (i) we get (x.x) = 
-(x.V). 

For (iii), note that on the integral affine subscheme WxZ g oi X', both the relative 
cotangent bundle and the relative dualizing sheaf over Spec(Z 9 ) are generated by 
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u\uy.i q , see [Liu02, §6.4]. Hence t generates the tangent bundle on U. But since X' 
and X are isomorphic outside the singular locus of X' by assumption, (ii) implies 

= (x.x) = -(V.x) 

for any Q ? -rational point x such that x does not reduce to oo or a singular point 
modulo q. So div(cj) cannot contain r , since all such points reduce to r . 

The proof of the first part of (iv) is analogous to the proof of (ii). If q does not 
divide /2g+i, then oo is nonsingular modulo q. The form uj g -i is just —du/2v on the 
afBne patch of X given by v 2 = f TCV (u), where u = 1/x and v = y/x 9+1 . So (iv) fol- 
lows in amanner similar to (hi) because du/2v generates ^spec(z,[«,«]/^-/«v(„))/s P ec(z 5 )- 

□ 

Remark 3.5. If g = 1 and the given equation of C is a minimal Weierstrass 
equation, then we can take X to be the minimal regular model of C. In this case 
it is easy to show that div(w) = 0. 

Remark 3.6. The computation of the function p requires the following ingredients: 

• Coleman integration, including iterated Coleman integrals [Ball 2], for the 
computation of t; 

• computation of p-adic height pairings [BB12, BMS12]; 

• a basis for the Mordell-Weil group of J tensorcd with Q. Typically one 
computes this by searching for points of small height in J(Q) (or differences 
of rational points on X) until we have found g independent points. Note 
that we hrst need to verify that the rank of J(Q) is indeed g, for instance 
using 2-descent on J, cf. [StoOl]. 

Clearly, computing the height pairings between all pairs of elements of the above 
basis, together with the computation of the integrals of the on elements of this 
basis, suffice for the determination of the constants ay. Integrals of holomorphic 
forms give the fi's, hence the gy's, and iterated Coleman integrals give r. Note that 
one can get bounds on the number of integral points without computing iterated 
integrals, see Remark 6.5. 

Remark 3.7. Here we describe how to use these ideas to give an algorithm to find 
integral points on a genus g hyperelliptic curve X with Mordell-Weil rank g: 

(1) Let D\,...,D g G Div°(X) be representatives of the elements of a basis for 
the Mordell-Weil group of the Jacobian tensored with Q. We compute the 
global p-adic height pairings h(D\, Di), h(D\, -D2), ■ • • , h(D gi D g ) and the 
g ( g + 1 ) x g ^ g 2 f 1 - ) matrix of the Coleman integrals j D ujk J D uii for 1 < i < 
j < g and < k < I < g — 1. Solving a linear system (see Example 7.2) 
gives the vector of ay values in (3.1). 

(2) Compute the finite set T of possible values that p can take on p-integral 
points. 

(3) Compute the value of r(P) for some point P on the curve. Use (4.1) and 
the power series expansions of the double and single Coleman integrals to 
give a power series describing r(t) in each residue disk. 

(4) The integral points are solutions to r(t) — J2 a ijfi(t)fj(t) = b across the 
various residue disks, where b runs through the elements of T. 

(5) Use bounds on the size of integral points to determine precision and prov- 
ably terminate the algorithm. 
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We show how to carry out Steps (l)-(4) in practice in Section 7. 

Remark 3.8. It would be interesting to try to use height pairings to find integral 
points for number fields other than Q. The problem is that in this case the p-adic 
manifold obtained by tensoring with Q p has dimension > 1 while the height pairing 
still provides only one equation. A possible solution is to use more than one idele 
class character. Let us describe one simple situation where this can be made to 
work. 

Suppose L is an imaginary quadratic extension with class number 1 where the 
prime p remains inert. Since the group of units is torsion, one can lift any £ p : 
Lp — > Q P to an idele class character: A uniformizer at the (principal) prime ideal 
(a) will be sent to — £ p (a). Composing the p-adic log with a basis of the Q p -linear 
functionals L p — > <Q> p , we get two different £ p , hence two idele class characters, from 
which we get two p-adic equations, and we may expect these to carve out a finite 
number of solutions in integral L p -points. 

4. Computation of t 

To obtain numerical examples, we need to compute the function p from (3.1). 
This function involves height function computations for the determination of the 
constants appearing there, for which we have the algorithm of [BB12, BMS12], 
and the computation of Coleman integrals of holomorphic forms, which can be 
performed using the algorithms of [BBK10]. 

The remaining component is the computation of the function r, which is an 
iterated Coleman integral [Ball2]. In this section, we discuss how to leverage in- 
formation about local p-adic height pairings to compute iterated integrals from 
tangential points as an alternative to direct computation of these integrals. 

First, when r is to be evaluated at a Weierstrass point, we extend the result of 
[BB13, Proposition 6.1] to hyperelliptic curves: 

Lemma 4.1. Let P = (A,0) be a Weierstrass point. Then we have 

T(P) = i(log(/'(A)) + log(/ 23+1 )). 

Proof. By the properties of the local height pairing, the value of t at P is 1/2 of 
the normalized value of x — A on the divisor (P) — (oo). As shown in [BB13, Propo- 
sition 6.1], the normalized value at P is log(/'(A)), while by (4.3) the normalized 
value at oo is Z^+i- D 

In general, one way to avoid directly computing iterated integrals from tangential 
points is to compute the iterated integral with one specific endpoint 1 P (chosen in 
a certain favorable way) in some indirect way. The integral for a general point can 
then be computed from this (see (4.1) below). For example, in the case of elliptic 
curves, in [BKK11] the point P was taken to be a global two-torsion point or a 
tangential point at infinity, while in [BB13] a (non-global) 2 or 3-torsion point was 
used. 

In the hyperelliptic curve case, we can also integrate from finite Weierstrass 
points, but this requires working over totally ramified extensions of Q p , which is, 
in practice, quite slow. As an alternative approach, we describe a technique for 



In this section we will be using the coordinate functions extensively. We therefore denote, 
unlike in previous sections, points on the curve by P, Q, etc. 
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computing t(P) for a general point P that does not use the description of r in 
terms of iterated integrals in Theorem 2.1 but instead works directly with the 
description of r as a local height pairing. Given t(P), the value of r at any other 
point P 1 may be computed using the following formula. 

(4.1) 




where we are using a decomposition of iterated integrals which may be found, for 
example, in [BKK11, p. 288]. 

We note that for applications, the description of r in terms of local power series 
expansions is vital. Thus, while we are able to compute t(P') for any given P', 
we will be using (4.1) after computing t(P) for one, or possibly a finite number, of 
values of P, as this last formula does in fact give such a power series expansion. 

The strategy for computing t(P) for a particular point P in a residue disk is 
to interpret it as a normalized local height and to directly compute this using an 
extension of the techniques of [BB12]. 

To this end, it is first of all somewhat helpful to use cotangent vectors rather 
than tangent vectors for the normalization. Let u denote a choice, for each point on 
the curve C, of a cotangent vector at that point. This could obviously come from 
a differential form but could also be just an arbitrary assignment. We can write 
h\ oc {Di,D2)u, for the local height pairing of D\ and D 2 , computed with respect to 
the tangent vectors dual to the cotangent vectors specified in u>. This notation has 
the advantage that if / is an isomorphism of curves, one has, by an easy argument 

hioc(f*D 1 ,f*D 2 ) f , 0J = h oc (Di,D 2 ) u 

where f*cj means the pointwise pullback of the cotangent vectors with respect to the 
differential of / at a point (this coincides with the usual pullback for holomorphic 
forms). Another important observation is that 

hi oc (D 1 ,D 2 )- 0J = hi oc (D 1 ,D 2 ) 0J , 

because of the property [BB13, Definition 2.3] of the local height pairing. For the 
hyperelliptic case, the assignment uj corresponds to one form away from infinity, 
and to the value of another form there, but both are anti-symmetric with respect 
to the hyperelliptic involution w, so with that choice of w we have 

h\ oc (w*Di,w*D 2 )u> = /iioc(-Di,£>2) w ■ 

Thus, with this choice of tangent vectors, the splitting into symmetric and anti- 
symmetric components of [BB12] extends without any difficulty. In particular, we 
can use formula (19) there. Using the shorthand ft-ioc(-D)w for h\ oc (D, D)^, and 
taking D = (P) — (oo), we have 
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T(P) = h oc ({P) - (oo))„ 

= \h loc ((P) (00) + W«P) - (00))) w + ^loc((P) - (00) - ttf((P) - (00))) u 
= h loc ((P) + (w(P)) - 2(00)) u + i/lloc(P - (w(P))) w • 

Suppose now that P = (a, 6) is a point with non-Weierstrass reduction. We 
compute h\ oc ((P) + (w(P)) ~ 2(oo)) w . Since (P) + (w(P)) — 2(oo) is the divisor 
of the function x — a, this local term is log on the normalized value of x — a on 
(P) + (w(Pj) — 2(oo). To compute this, we note that a normalized parameter with 
respect to to at P is 

. „ i x — a 

and so the normalized value (x — a) [P] is just 2b. Similarly, the normalized value at 
w(P) is —26. On the other hand, using the normalized parameter — y / (f2g+i% 9+1 ) 
with respect to t at infinity, the normalized value of x — a at infinity is 

(4.3) lim (x a) = }™J X *) ^^1+2 = ^+1 

recalling that / has leading coefficient fi g +i- Collecting all the data and us- 
ing [BB13, Definition 2.3] we find 

hU(P) + MP)) - 2(oo))„ - log(-4& 2 / 2 2 s+1 ) = log(46 2 ) + 21og(/ 2s+1 ) . 

Next we compute the anti-symmetric part, hi oc ((P) — (w(P))) u . By [BBf3, 
Proposition 3.4], the local height pairing for two divisors (which are not necessarily 
disjoint) is given by 

h loc (D 1 ,D2) u = G Dl [D 2 ] 

where, by [Bes05, Theorem 7.3], if the degree of D\ is 0, the Green function Gd x 
coincides with with the Coleman integral J uid 1 ■ Here is, as in [BBf 2], a certain 
form of the third kind whose residue divisor is Di , and the value of this integral on 
D 2 is normalized by taking constant terms with respect to local parameters which 
are normalized with respect to the chosen cotangent vectors. 

In our case we have D\ = D 2 = D = (P) — (oo). As in [BBf 2, Algorithm 5.8] 
we can decompose cjd as v — 77, where v is an arbitrary form of the third kind with 
residue divisor D while rj is holomorphic and is computed from v and from the 
decomposition (2.1) as in [BBI2, Subsection 5.6]. We consequently have 



h loc (P - (w(P))) u = [ v- I n 

Jw(P) Jw(P) 



'w(P) Jw{P) 

The integral of 77 can be computed using the techniques in [BBKIO]. The integral 
Jw(p) v nas to k° normalized since v has poles at the endpoints P and w(P). To 
properly account for this normalization, we use an auxiliary point Q in the same 
residue disk as P. 

More concretely, by [BBI2, Proposition 5.13] we may take v — y ^ a) , which is 
anti-symmetric. Using this and breaking the path of integration into several pieces, 
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we obtain 

r P r Q r P 



rf rf 
\ V = / U+ I V = I V + / V, 

Jw{P) Jw(P) JQ Jw(Q) JQ 





rQ 






/ " + 




IQ 


Jw(Q) 


JQ 



rQ rQ 
= -2 / v + \ v . 

JP Jw{Q) 

The first integral is a tiny integral, but again, since it has a pole at P, this has 
to be normalized with respect to the chosen cotangent vector, which we do below. 
The second integral was considered in [BB12, Algorithm 4.8]; here we make a small 
modification, also discussed below, to handle a subsequent part of the computation 
(that of a tiny integral with a pole within the disk of integration). 

4.1. Computing Jp v. Let z be a parameter at P, e.g., (4.2), which is normalized 
with respect to u>. Writing v in terms of z we get v = (z^ 1 + a$ + a\z + ■ ■ ■ )dz, 
since v has a simple pole with residue 1 at P. The normalized integral Jp v is 
the normalized integral of v evaluated at Q. The normalization means that the 
constant term with respect to z of the integral is 0, i.e., that it is of the form 
log(z) + a z + aiz 2 /2 + • • • . If we use the parameter t = (x — a) = 2bz instead 
we can rewrite this as log(£) — log(26)+ some power series in t, where this power 
series is nothing but the term- by-term integral of v — t~ 1 dt. Thus, in terms of the 
parameter t, the formula for the integral is 

rQ rt(Q) 
/ v = log(t(Q)) - log(26) + / {y - t~ x d£). 
Jp Jo 

4.2. Computing J® v. By [BB12, (14)], we know 



(4.4) 



)w(Q) 

rQ 



f v=-^— [*(<*) U *(/?) + VRes A (a U\ -2 f 
Mq) 1 ~P V Its \ J J JQ 




where a = — pv (a form constructed via a p-power lift of Frobenius <f), S is the 
set of closed points, (3 has residue divisor Q — w(Q), and <f is a logarithm map. 

Each of the quantities in (4.4) can be computed using the techniques in [BB12], 
except for Res^g.s (a J /3) , since /? has poles in certain residue disks which, by 
construction, are disks which contain points in S - i.e., disks where the integration 
will take place. Here we give an elementary lemma which allows us to extend the 
techniques of [BB12] to handle this case. 

Lemma 4.2. Suppose j3 has residue divisor Q — wQ, where Q is assumed to be 
non- Weierstrass. The integral of (5 computed between points P, P' distinct from Q 
but contained in the residue disk of Q, written as 

[ P ' = f f(x(Q))-m , f x(P')-x(Q) \ 
Jp P Jp y(x-x(Q))(y(Q)+y) ' + V x(P) - x{Q) J 
converges. 
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Proof. Since P, P' are in the same residue disk, we compute a locally analytic 
parametrization (x(t),y(t)) from P to P' and use this to rewrite: 

y(Q)dx 



y(x - x(Q)) 
p ' y{Q)dx (I 1 \ f p ' dx 



p 



X (Q) \y y(Q)J Jp x-x(Q) 



-I 



f(x(Q)) - f(x(t)) dx{t)+log f<n-<Q) 



Jo y(t)(x(t)-x(Q))(y(Q)+y(t)) w *\x(P)-x(Q) 

It remains to check the convergence of the integrand. Since Q is non-Weierstrass, 
both y{t) and y(Q) + y(t) are units in Z p [[£]]. 

The claim is then that v p ^^^ft^-x^Q)^ ) — ^- Indeed, since x(Q) — x(t) divides 

f(x(Q)) - f(x(t)), we see that v p ('^p^^) is simply the p-adic valuation of 
the linear coefficient of f(x), which is assumed to be integral. 

□ 



5. Computing all possible values of p on U(Z[l/p\) 

Let (j be a prime and let <f> : X — > X' denote a desingularization in the strong 
sense of the Zariski closure X' of C := X x Q q over Spec(Z g ). 

For D G Div(C) we also write, by abuse of notation, D for the Zariski closure 
(with multiplicities) of D in Div(X). If D £ Div (C), then there exists a vertical 
Q-divisor $ q {D) E Div(X) ® Q such that D + $ q (D) has trivial intersection multi- 
plicities with all vertical divisors on X, see [Lan88, Theorem III. 3. 6]. Since & q (D) 
is vertical itself, we have 

(*,(£)).£>) = -$ q (D) 2 > 0. 

From now on, we will assume that X is a desingularization in the strong sense of 
the Zariski closure X' of X x Q q over Z q . It follows from the proof of Theorem 3.1, 
from (3.3) and from Lemma 3.4 that a point x G U(Z[l/p\) satisfies 

P( x ) = - E (^(O*) - (°°)) 2 + ( x • V ^ + (°° • W «)) l0 S9 ' 

where V q is the vertical part of div(w) and W q is the vertical part of div(u; g _i) over 
q. Note that only bad primes can contribute toward the sum on the right hand side. 
By Lemma 3.4, the p-adic constant s q from the proof of Theorem 3.1 is therefore 
equal to 

(5.1) Sq = (*,((*) - M) 2 + ((X) . W q )) \og(q) . 

5.1. Computing local contributions. In this subsection we discuss how the 
quantity 

(5.2) ^ q {{x)-{oo)) 2 + {x.V q ) + (oo.W q ) 

can be computed for a given prime q and x G U(Z q ). Let M q = {mij)ij denote 
the intersection matrix of the special fiber X q = ^aTi. Its entries are given by 
rriij = {aiTi .djTj). For simplicity, we drop the subscript q in the following, as we 
will work over a fixed prime q. 
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The matrix M has rank n — 1, where n is the number of irreducible components 
of X q , and its kernel is spanned by the vector (1, . . . , 1) T , see [Lan88, § III. 3]. Let 
M + denote the Moore-Penrose pseudoinverse of M and let u(x) denote the column 
vector whose ith entry is (x — oo . cuTi). Then we have 

(5.3) <f>((x) - (oo)) 2 = u(x) T M+u{x) . 

We sketch a different method to compute $((#) — (oo)) 2 . By [CR93] one can 
associate a metrized graph (the reduction graph) G to the special fiber X q such 
that if X q is semistable, then we can take G to be the dual graph of X q , suitably 
metrized. 

Let T x and denote the component oiX q that x and oo reduce to, respectively. 
Then we have 

$((*)- (oo)) 2 = -r(T x ,T oc ), 
where r(r a; ,P 00 ) is the resistance between the nodes T x and if we consider the 
graph G as a resistive electric circuit such that the resistance of each edge is equal 
to its length. Using work from [BGX03], there is a purely graph-theoretic formula 
for the resistance when all edges have length 1, which is the case for semistable X q . 

Now we discuss the computation of (x . V). Recall that div(w) = H + V, where 
H = (2g — 2) • oo G Div(X) is horizontal and V is vertical. We call a Q-divisor K, 
on A" a canonical Q-divisor on X if O(IC) = u> x / spcc(z q ) , where Ux/ Spcc(z,) is the 
relative dualizing sheaf of X over Spec(Z 9 ). 

Since div(w)|x = (2g — 2) ■ (oo) is a canonical divisor on X, it follows from [CK09, 
Proposition 2.5] that we can extend if to a canonical Q-divisor K, = H + V' on X 
if V' is a vertical Q-divisor such that JC satisfies the adjunction formula 

(5.4) (/c.r) = -r 2 + 2 Pa (r) -2 

for all components T of the special fiber of X, where p a denotes the arithmetic genus. 
Such an extension always exists (see [KM12, §2] for a more general statement), but 
it is not unique, because the validity of (5.4) is unchanged if we add a multiple of 
the entire special fiber X q to V . Under the additional condition that V does not 
contain the component T , the Q-divisor V is uniquely determined. Recall that T 
is the unique component of the special fiber of X which dominates the special fiber 
of X'. 

It is not true in general that div(w) is a canonical divisor on X, since u> x / spcc(Zq) 
may differ from the relative cotangent bundle ^^y Spcc ( Z )• But restricting to the 
smooth locus X sm , we have 

Ux/Spec(l, Q )\xs™ - ^A7Spcc(Z„)U sm 

by [Liu02, Corollary 6.4.13]. Hence V — V is supported in the vertical divisors of 
multiplicity at least 2, which immediately implies 

(x.V) = (x.V). 

Therefore it suffices to compute V'. The adjunction formula (5.4) and the constraint 
r ^ supp(U') reduce the computation of V to a system of linear equations which 
has a unique solution by the above discussion. 

The computation of (oo . div(io s _i)) is analogous: We find a vertical Q-divisor 
W such that adding W to the horizontal part of div(a; ff _i) gives a canonical Q- 
divisor which does not contain r in its support. Then we know that 

(oo . div(wg-i)) = (oo . W) = (oo . W). 
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5.2. Local contributions for p in genus 2. Keeping the notation of the previous 
subsection, we now assume, in addition, that q does not divide ,f2 g +i- In this 
situation one can compute all possible values of 

$((x) - (oo)) 2 + (x . V) 

if the genus of X is 2 and X q has semistable reduction purely from the reduction 
type. 

The possible special fibers of minimal regular models of curves of genus 2 have 
been classified by Namikawa-Ueno [NU73]. In particular, it is known that if X is 
semistable, then the reduction types of X are either [I ni — J„ 2 — m] or [7„ 1 _„ 2 _„ 3 ], 
where m,n\, ri2,Ti3 > are integers. 

In Table 1 we list the stable model corresponding to each reduction type and 
the relation to the discriminant A of a minimal Weierstrass model of X. Because 



reduction type 


stable reduction 


v g (A) 


[Jo-o-o] 


smooth curve of genus 2 





[Jm-O-o] 


genus 1 curve with a unique node 


m 


[Ini — "2— o] 


genus curve with exactly 2 nodes 


m + n 2 


[In 1 —n2—n 3 ] 


union of 2 genus curves, 
intersecting in 3 points 


ni + n,2 + m 


[J - J - m] 


union of two smooth genus 1 curves, 
intersecting in 1 point 


12m 


[I n - J - TO] 


union of a smooth genus 1 curve and a 
genus curve with a unique node, 
intersecting in 1 point 


12m + ni 


[I ni - J«2 - TO] 


union of 2 genus curves with a unique 
node, intersecting in 1 point 


12m + n\ + n 2 



Table 1. Semistable reduction types in genus 2 



of our assumptions on /, the reduction type [J ni _„ 2 _„ 3 ] cannot occur, since the 
reduction of / modulo q would have to be a square. 

We now list all possible values &((x) — (oo)) 2 , where x G X(Q q ). We also list 
all possible values (x . V) and the possible sums &((x) — (oo)) 2 + (x.V). In the 
present case of semistable reduction, we have V = V in the notation of the previous 
subsection. 

First we consider reduction type [J ni _„ 2 _o], where ni,ri2 > 0. It is easy to see 
that we have V = V = 0. The set of all possible values for <S>((x) — (oo)) 2 is 

{-^} U {-^^} , < i < LW2J, < j < L n 2 /2j, 

see [MS13, §6]. The possible values for reduction type [J ni — J„ 2 — m], where 
ni,7i2,m > 0, depend on where oo intersects the special fiber. This information 
can be obtained easily from the equation of X. 

If oo intersects a component corresponding to I ni , then it is easy to see that 
(x.V) € {0, —1, ... , — m}. Moreover, we have 

(5.5) *((*) - (oo)) 2 e U -L^-Jl t - m - M-^) j j 

[ m n 2 J 
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where i G {0, . . . , m}, j G {0, . . . , Lrz-i / 2J } and fc G {0, . . . , L n 2/2J }. If oo intersects 
a component corresponding to I n2 , then we get all possible values upon swapping 
both n\ and n 2 and j and fc in (5.5) and (5.6). 

Remark 5.1. If we have two singular points in the reduction of X modulo q 
which are conjugate over F 9 2, then we know that x and oo intersect T and hence 



Remark 5.2. Note that the denominators are contained in {1, m, nin 2 }. 

Remark 5.3. If q divides fz g +\, then both V and $((#) — (oo)) 2 depend on where 
oo intersects the special fiber; moreover, W depends on the horizontal part of 
div(wg-i). Hence none of the summands in (5.2) can be read off purely from the 
reduction type and the component that x intersects. Nevertheless, for a given 
curve X the computation of all possible values of p on p-integral points is not more 
difficult using the techniques presented in Subsection 5.1 . 



In this section we sketch how one may use the techniques of this paper to obtain 
a bound on the number of p-integral points on a hyperelliptic curve. We hope that 
these techniques will eventually lead to an effective bound that will only depend on 
the genus and the bad reduction types. 

The following is a trivial extension of Lemma 2 in [Col85a]. 

Lemma 6.1. Let f(t) = a n t n be a power series with coefficients in C p . Let j3 be 
the function whose graph is the bottom of the Newton polygon for f, and let k > 
be an integer. Then, for any s > we have 

#{z G C p , v(z) > s, f{z) — 0} < max{n > k, v(au) — s(n — k) > (3(n)}, 

where v is the p-adic valuation normalized so that v(p) = 1. 

Proof. Suppose that there are m zeros of / with valuation > s. Since f3(k) < v(ak) 
it suffices to show that f3(m) > f3(k) — s(m — k). But this is clear, as by assumption, 
by the properties of the Newton polygon, the slopes of (3 up to the point where 
x = m are at most — s. □ 

Obviously, in the above lemma, any lower bound for j3 may be used instead of 
(3 to obtain an upper bound on the number of zeros. To use this in our setting, we 
need the following obvious lemma. 

Lemma 6.2. Suppose, under the assumptions of the above lemma, that f is an 
r-fold iterated integral of forms u>i,...,u) r which have integral coefficients, and that 
furthermore the constants of integration at each integration are integral as well. 
Then we have the lower bound P(k) > — rlog p (fc). 

In [Col85a] , Coleman uses the following corollary to get an upper bound on the 
number of rational points. 



and 



(5.6) 




*((a:) - (oo)) 2 + (x . V) =0. 



6. Bounds on the number of p-integral points 
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Corollary 6.3. Suppose under the assumptions of Lemma 6.1 that f has integral 
coefficients and that its reduction has order k. Then, 

#{z G C p , v(z) > s, f(z) = 0} < max{n > k, s(n - k) < [log p (n)]} . 

Coleman then uses a bound on the sum of all possible fc's in the above corollary, 
over all power series occurring as local power series expansions of a Coleman inte- 
gral, to obtain a global bound on the number of rational points, when Chabauty's 
method applies, which depends only on the genus and the number of points in the 
reduction of the curve. 

As an application of our techniques, we give a fairly easily computed bound on 
the number of integral points. The reader will recognize that one may improve the 
bound and remove restrictions at the cost of making the bound depend on more 
complicated data. 

Specializing Lemmas 6.1 and 6.2 to the case r = 2, k = s = 1, we get that for 
power series satisfying the conditions of Lemma 6.2, the number of pZ p roots is 
bounded from above by 

(6.1) T(u(ai)) , with T(m) = max{n > 1, m — (n — 1) > — 2 log p (n)} . 

We note, for ease of application that 

(1) if m + 3 < p, then T(m) = m + 2. 

(2) if m + 5 < p 2 , then T(m) = m + 4. 

Theorem 6.4. Suppose, under the assumptions of Theorem 3.1, that the polyno- 
mial f has no roots modulo p. Let dij be the constants appearing in the theorem 
and let T be the function from (6.1). 

Let P. m be 7L v -points of X lifting all 1/p points of the special fiber save oo, and 
for each m, let t m be a local parameter at P m , whose reduction modulo p is also a 
parameter. Define constants (3i m , ji m , ji m by 

ftim — ~TT {Pm) 

and 

r-P m 



f ^ - _ f 

"fim / Ldi 7 'Jim / • 

J to J to 

Let L = max{0, — v(ctij), < i, j < g — 1} and let 

9-1 

5m = 2 

i—0 i<j 

Then, the number of integral solutions to y 2 = f(x) is bounded from above by 

(i + nK-i))E T M<W + £) 

q m 

where, as in Remark 3.2, m q denotes the number of multiplicity 1 components at 
the fiber above q. 

Proof. Note that the combined assumption that we are looking at integral, rather 
than p-integral points, and that / has no roots modulo p, means that we only need 
to consider non-Weierstrass residue disks. Multiplying by p L clears denominators 
from the and it is therefore evident that the expansion of p L p around P m 
satisfies the condition of Lemma 6.2, except possibly that the constant coefficient 
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of the power series is not integral. The first coefficient is exactly p L 5 m . Thus, to 
conclude the proof by Theorem 3.1, Lemma 6.1, Lemma 6.2, and Remark 3.2, it 
suffices to note that having a non-integral value for the constant coefficient of the 
power series flattens the Newton polygon and so improves the bound on the number 
of solutions. □ 

Remark 6.5. To use the theorem above to compute a bound on the number of 
integral points, one needs the following: 

• Computation of (non-iterated) Coleman integrals. These are required for 
the computation of 7^ , 7^, and ay. 

• A basis of J(Q) <g> Q. 

• Computation of p-adic height pairings [BB12, BMS12]. 

In particular, explicit iterated Coleman integrals are not required. One may use 
them, however, or the techniques of Section 4, to get the values of r at the points 
P m , and these may be used to give a more precise bound, typically with more work. 

At the moment, we are unfortunately unable to produce a Coleman-Chabauty- 
like bound, because we are unable to control a quantity like the sum of all fc's in 
Coleman's case for iterated Coleman integrals. We plan to return to this question 
in future work. 

Remark 6.6. We may observe that we can treat primes in the denominators of 
our points, at the cost of extending the set T, as long as we have a bound on the 
denominator. 

7. Examples 

Here we give some examples illustrating the techniques described in this paper. 

7.1. Genus 1. Let X : y 2 = x 3 — 3024a; + 70416, which has minimal model with 
Cremona label "57al". We have that X(Q) has Mordell-Weil rank 1, with P = 
(60, —324) a generator of the free part of the Mordell-Weil group. We take as our 
working prime p — 7. Since the given equation of X is not a minimal Weierstrass 
equation, Remark 3.5 does not apply. The only primes q where the Zariski closure 
of X over Spec(Z 9 ) is not already regular are q = 2, 3. Using Magma [BCP97], we 
compute a dcsingularization X of the Zariski closure of X in the strong sense. The 
special fibers are of the form 

X-2 = T2,0 + T2,l + 2r2,2 

and 

^3 = L3,o + L3,i + 1^3, 2 + 2L 3j 3, 
where all components Y q ^ have genus and L g .o is the component which 00 inter- 
sects. The intersection matrices are given by: 

-2 

l/o=| 0-2 1 I . \l. 

Writing div(w) = V 2 + V3, we have 

(7.1) V 2 = -r 2 ,i + a 2 ,2^2,2 and V3 = -r 3> i - r 3i2 + 03,3^,3, 

computed using the method alluded to in Section 5. 



/ 


-2 








1 \ 







-4 


2 


1 







2 


-2 





V 


1 


1 





1 1 
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Here <22,2 and 03,3 are irrelevant, since no Z 9 -section can intersect a component 
of multiplicity at least 2. 

7.1.1. Computing local and global p-adic heights of a Mordell-Weil generator. Using 
Sage [S + 13], we compute the local height above 7 using Coleman integration and 
cotangent vectors: 

h 7 (P -wP,P- wP) = 4 • 7 + 4 • 7 2 + 7 3 + 7 4 + 6 • 7 5 + 6 • 7 6 + C>(7 7 ), 

and we can compute the contributions away from p = 7 using (7.1) and (5.3) to 
deduce the global 7-adic height: 

h(P -wP,P- wP) =h 7 (P- wP, P - wP) - 2 log 7 (2) - 2 log 7 (3) 

= 6 • 7 + 5 • 7 2 + 4 • 7 3 + 4 • 7 5 + 7 6 + C>(7 7 ), 

which one can check agrees with Harvey's implementation [Har08] of Mazur-Stein- 
Tate's algorithm [MST06] for h(2P). 
This gives 

r(P) = 1 log 7 (4y(P) 2 ) + h(P -wP,P- wP) 

= 4 • 7 2 + 4 • 7 3 + 5 • 7 4 + 3 • 7 5 + 6 • 7 6 + 0(7 7 ). 

Since r(P) is the local component of the 7-adic height above 7 (that is, by 
definition, we have t(P) = h 7 (P — 00, P — 00)), we can put this together with the 
contributions away from p = 7 to compute the global 7-adic height: 

h(P- 00) = /i(P- 00, P- 00) = t(P) -21og 7 (2) - ^log 7 (3) 

= 5 • 7 + 6 • 7 2 + 2 • 7 3 + 5 • 7 4 + 2 • 7 5 + 0(7 7 ). 

7.1.2. Computing a 00 . We have 

h(P- 00) 

This gives 

p(P) = r(P) - a 00 



(/» w o) 



2 • 




= 2 • 7 + 4 • 7 2 + T + 7 5 + 6 • T + 0{T). 

7.1.3. p(z) values. By (3.2), we can combine (5.3) and (7.1) to compute the set T 
of all possible values of p(z) for 7-integral points z on X. It turns out that we have 

T = {i ■ log 7 (2) + j ■ log 7 (3) : * = 0, 2, j = 0, 2, 5/2} . 

We compute the values of 

p(z) = t(z) - a o ^ 

for the sixteen integral points: 

(-48, ±324), (-12, ±324), (24, ±108), (33, ±81), 
(40, ±116), (60, ±324), (132, ±1404), (384, ±7452), 

and find that they all lie in the set T, as summarized below. 
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z 


p(z) 


(-48, ±324) 
(-12, ±324) 
(24, ±108) 
(33, ±81) 
(40, ±116) 
(60, ±324) 
(132, ±1404) 
(384, ±7452) 


21og 7 (2) + |log 7 (3) 
21og 7 (2) + 21og 7 (3) 
21og 7 (2) + 21og 7 (3) 

|log 7 (3) 

21og 7 (2) 
21og 7 (2) + |log 7 (3) 
21og 7 (2) + 21og 7 (3) 
21og 7 (2) + |log 7 (3) 



Remark 7.1. Note that this is recovering the result in [BDCKW12, p. 23] de- 
scribing the set X(7jp)2 of weakly global points of level 2 for a semistable rank 
1 elliptic curve, up to normalization. Our p(z) values, calculated on a minimal 
model, are twice the corresponding values of \\w\\. The reason is that our normal- 
ization of the p-adic height corresponds to the normalization in [Har08], which is 
2p times the normalization in [MST06]. One the other hand, the normalization 
used in [BDCKW12] is p times the normalization in [MST06]. One can also check 
directly that our formulas for the local contributions are twice the formulas for the 
local contributions in [BDCKW12]. 

7.2. Genus 2. Let X : y 2 = /(x), where f(x) = x 3 (x - l) 2 + 1. Using a 2-descent 
as implemented in Magma, one checks that the Jacobian J of X has Mordell-Weil 
rank 2 over Q. We are grateful to Michael Stoll for checking that X has only the 
six obvious integral points (2, ±3), (1, ±1), (0, ±1), using the methods of [BMS+08]. 
Let P = (2, -3), Q = (1, -1),R = (0, 1) e J(Q). Then [(P) - (oo)] and [(Q) - (R)} 
are generators of the free part of the Mordell-Weil group. We take as our working 
prime p = 1 1 . 

The only prime q where the Zariski closure of X over Spec(Z g ) is not already 
regular is q — 2. Using Magma, we find that there is a desingularization of the 
Zariski closure of X in the strong sense such that 

x 2 = r + r 1 + r 2 + r 3 , 

where all components Tj have genus 0. Since the intersection matrix is 

4 2 1 1 \ 
2-200 
10-21' 
1 1 -2 / 

this implies that div(w) = 2oo has no vertical component. Note that Tq and Ti 
have an intersection point of multiplicity 2 and To, T2, T% intersect in one point, so 
X is not semistable. 

7.2.1. Heights of Mordell-Weil generators. Let £>i = (P)-(oo) and D 2 = (Q)-(R). 
Using Sage, we compute the local 11-adic height above 11 as 

h 11 (D 1 ,D 1 ) = 1 lo gll (4y(P) 2 ) + \h n ({P) - (wP), (P) - (wP)) 

= 10 • 11 + 2 • ll 2 + 9 • ll 3 + 9 • ll 4 + 8 • ll 5 + 0(11 7 ). 



M 
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Away from p = 11, h{D\, D\) only has a contribution at 2 (computed using (5.3)), 
and we have 

h(D 1 ,D 1 ) = -| Iog n (2) + ftn(Di, Di) 

= 7 • 11 + ll 2 + 8 • ll 3 + 8 • ll 4 + 7 • ll 5 + 2 • ll 6 + 0(11 7 ). 

Using 2 (P)-(S)-(wS) ~ 2(P)-2(oo), where S = (-2/9, -723/9 3 ), we compute 

hu(D u D 2 ) = \ (h n ((P) - (S), (Q) - (R)) - h u ((P) - (wS), (wQ) - (wR))) 
= 4 • 11 + 4 • ll 2 + 8 • ll 3 + 4 • ll 4 + 9 • ll 5 + <3(11 7 ) 

and 

h(D 1 ,D 2 ) = \ (h((P) - (S), (Q) - (R)) - h((P) - (wS), (wQ) - (wR))) 

= \ (hn((P) (S), (Q) - (R)) (-± Io gll (2) + h n ((P) (wS), (wQ) (wR)) 

= 3 • 11 + 10 • ll 2 + 9 • ll 3 + 6 • ll 4 + 4 • ll 5 + 4 • ll 6 + 0(11 7 ). 
Finally, we have 

hu(D 2 ,D 2 ) = hu((Q) - {R), (wR) - (wQ)) 

= 3 • 11 + 4 • ll 2 + 4 • ll 3 + ll 4 + ll 5 + 3 • ll 6 + 0(11 7 ) 

and 

h(D 2 ,D 2 ) = llog n {2) + h 11 {D 2 ,D 2 ) 

= 8 • 11 + 2 • ll 4 + 6 • ll 5 + 6 • ll 6 + 0(11 7 ). 

Note that for h(Di,D 2 ) and h(D 2 ,D 2 ) our chosen representatives are disjoint, so 
we can use the techniques of [BB12, BMS12]. 

7.2.2. The coefficients otij. We compute the coefficients using the matrix of the 
various products of Coleman integrals evaluated at D\ 1 D 2 (as in [BBK10]) and the 
global 11-adic heights computed above: 

I Dl w o J Dl w J Dl w J Di wi J Di wi J Di wi \ _1 / ft(Di, Di) 
Id^oJd^o I Dl ^oI D2 ^i Jc^i^wi h{D u D 2 ) 
Id 2 w o Jo 2 w J D2 w J D2 wi / Da wi J Da wi / V MAs , D 2 ) 

This gives 

a 00 = 8 • ll" 1 + 8 + 5 • 11 + 3 • ll 2 + 2 • ll 3 + 9 • ll 4 + 10 • ll 5 + 6 • ll 6 + 9 • ll 7 + 0(11 9 ), 

aoi = 6 • ll" 1 + 3 + 10 • 11 + 3 • ll 2 + 10 • ll 3 + 3 • ll 4 + 3 • ll 5 + 9 • ll 6 + 2 • ll 7 + ll 8 + <3(11 9 ), 

an = 8 • ll" 1 + 8 + 3 • 11 + 10 • ll 2 + 4 • ll 3 + 8 • ll 4 + 10 • ll 5 + 3 • ll 6 + 7 • ll 7 + 9 • ll 8 + 0(11 9 ). 

7.2.3. Values of p on 11 -integral points. Since div(w) is horizontal and the polyno- 
mial / is monic, we only need (5.3) to compute the set 

r=|o,llog 11 (2),Hlo gll (2)| 

of all possible values of p on 11-intcgral points. 



/ "00 




1 aoi 


■( 


\ an 
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7.2.4. Constructing the dual basis. Now we describe how to construct power series 
whose zeros contain the set of integral points of X. For this, we will need to use the 
expression for r given in Theorem 2.1. (Note that thus far, we have only used the 
expression for r as a local height, which does not give a power series expansion.) 

First, we must compute the dual basis for {w ,uii} in W. To make the calcula- 
tions compatible with previous height computations [BB12], we take W to be the 
unit root subspace. 

Recall that a basis for W is given by {(</>* ) n u>2, {<P*) n ^3}, where is a lift of 
p-power Frobenius and n is the working precision. Thus we let uij, for j = 2,3, 
be the projection of uij on the unit root subspace along the space of holomorphic 
forms. We clearly have [uij] U [u>j] = [uij] U [w,] for i < 1 < j. 

Here is the cup product matrix for X: 



( 



\ 



I 

3 



where the ijth entry is giving the value of [u>i] U [uij]. The inverse of the bottom 
left corner (which gives the above cup product values [uij] U [wj]) is 

4 -3 



-1 







and this immediately gives 



Ul a = 4(D 2 - 



(note that this is consistent with (2.4)). 

We now need to compute the projection of UJ2,^3 with respect to the basis 

{U0,W1,W2,W3}. 

We do this by inverting the matrix with first two columns from the identity 
matrix and last two columns from the Frobenius matrix raised to the working 
precision. Calling the resulting upper-right submatrix 

a b 
c d 



we find 



3 + io-ii + io-ir 



ii 4 



b = 6 + 8 • 11 
c = 4 + 3- 11 
d = 6 + ll 2 + 9 • 
so that we have 



which gives 



4- IV 



ll- 1 



9- ir 



6-lr + 6- IT' 
ll 3 + 5 • 11^ 



+ 7- 

UJ 3 



ir + 5- n 

9 • 11 
ll 5 



11' 
-2- 



IT 



IT 
-6- 



11' 



IT 
-2- 



5- ll 1 



0(11' 



IT 



5- ll 9 + 6- ll iu + 0(ll il ) 



10- IT 



4 • ll 6 + - 



4- ll b H 
ll 8 + 2 



5 • 11 + 2 • 11 + 2 • ll 9 + 0(11^) 
ll 10 + O(ll n ) 



UJ 3 



buj - 



- buii 
duii 



ui = (-4a + 3b)u>o + (-4c + 3d)uii + 4ui 2 - 3ui 3 
cJi = auiQ + cuii — ui2- 
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One can check that this gives [u>i] U [u>j] — Sij- 

Now we use the dual basis to construct power series within each of the following 
Fn-residue disks: 

{ (07±T) , (lT±T) , (2~±3) , (8~±3) , (4~±4) , (6~0) } . 

For example, starting with the residue disk of (0,1), we first lift (0,1) to the 
point P = (0, 1). We compute t(P) using its interpretation as the p-component of 
the global p-adic height pairing of P — oo: 

t(P) = 3-ll+2-ll 2 +ll 3 +4-ll 4 +2-ll 5 +9-ll 6 +8-ll 7 +6-ll 8 +9-ll 9 +7-ll 10 +O(ll 11 ). 
Using this, we write down a power series expansion for p in the disk of (0, 1): 




Setting this equal to each of the values in the set T = {0, | log n (2), | log 11 (2)}, we 
find that the points z in the residue disk (0,1) with x-coordinates 

{O(ll 11 ),4-ll+7-ll 2 +9-ll 3 +7-ll 4 +9-ll 6 +8-ll 7 + ll 8 +4-ll 9 + 10-ll 10 +O(ll 11 )} 

are the only Zn-points z which achieve p(z) values in the set T. 

This computation recovers the integral point (0, 1) and tells us that no other 
integral points in the disk exist with ^-coordinate less than ll 11 . Here we summarize 
the computation in the other residue disks: 



disk 


x{z) 


p(z) 


(0,±1) 


4 • 11 + 7 • ll 2 + 9 • ll 3 + 7 • ll 4 + 9 • ll b + 0(H Y ) 

o(n 7 ) 


|logu(2) 
|logu(2) 


(1,±1) 


1 + 3 • 11 + 3 • ll 2 + 4 • ll 3 + 9 • ll 4 + 3 • ll 5 + 10 • ll 6 + 0(11 7 ) 

1 + 0(11 7 ) 

1 + 10 • 11 + 6 • ll 2 + ll 3 + 2 • ll 4 + 7 • ll 5 + 4 • ll 6 + 0(11 7 ) 




|logu(2) 
|logn(2) 


(2, ±3) 


2 + 7 • 11 + 2 • ll 2 + 2 • ll 3 + 7 • ll 5 + 10 • ll b + 0(11 7 ) 
2 + 10 • 11 + 9 • ll 2 + ll 3 + 3 • ll 4 + 6 • ll 6 + 0(11 7 ) 

2 + (9(ll 7 ) 

2 + 6 • 11 + 2 • ll 2 + 4 • ll 3 + 4 • ll 4 + 6 • ll 5 + 4 • ll 6 + (9(11 7 ) 





|logu(2) 
!logu(2) 


(8, ±3) 


8 + 7 • ll 2 + 7 • ll 3 + 4 • ll 4 + ll 5 + 8 • 11 B + 0(11 7 ) 
8 + 4 • 11 + 3 • ll 2 + 7 • ll 3 + 2 • ll 4 + 3 • ll 5 + 2 • ll 6 + (9(11 7 ) 
8 + 10 • 11 + 4 • ll 2 + 8 • ll 3 + 9 • ll 4 + 10 • ll 5 + 2 • ll 6 + (9(11 7 ) 
8 + 5 • 11 + 2 • ll 2 + 8 • ll 3 + 4 • ll 4 + 10 • ll 5 + 10 • ll 6 + 0(11 7 ) 





|logn(2) 
flogn(2) 


(4, ±4) 


4 + 11 + 7 • ll 3 + ll 4 + 8 • ll 5 + ll e + 0(11 7 ) 
4 + 3 • ll 2 + 7 • ll 3 + ll 4 + 8 • ll 5 + 8 • ll 6 + (9(11 7 ) 


5 lo gn(2) 
|logu(2) 


(6,0) 


6 + 9 • 11 + 6 • ll 2 + 7 • ll 3 + 2 • ll 4 + 9 • ll 5 + 0(11 7 ) 


5logn(2) 



In particular, here are the recovered integral points and their corresponding p 
values: 



z 


P( z ) 


(2, ±3) 

(1,±1) 
(0,±1) 


flogii(2) 
|logii(2) 
Ilogii(2) 
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